Server Repair Engineering Supervisor - Grapevine, TX
Grapevine, TX Direct-Hire $65000.00 - $65000.00 Onsite

Job Description

The Server Repair Engineering Supervisor at a AI Server Service Center, the position will combine responsibilities for both testing hardware/software systems (TE) and streamlining operational or manufacturing processes (PE). Below is a detailed job requirement and description tailored to this dual-role position:

Essential Duties and Responsibilities include the following. Other Duties not listed may be assigned.

Test Engineering (TE):

- Oversee a team of test engineers conducting diagnostics, validation, and troubleshooting of AI server hardware/software.

- Implement and monitor testing protocols to ensure compliance with Dell's quality standards.

- Evaluate AI servers for performance, reliability, and functionality, using advanced diagnostic tools and methodologies.

- Develop or enhance automated testing scripts and procedures for AI server systems and components.

- Collaborate with the Product Development and Quality Assurance teams to identify and resolve issues during testing and validation.

Process Engineering (PE):

- Analyze, design, and implement efficient workflows for testing, repairing, and upgrading AI server systems.

- Lead initiatives to improve speed, quality, and cost-effectiveness of processes in the service center.

- Perform root cause analysis of process failures and develop corrective/preventive action plans.

- Stay updated on industry best practices for process optimization and manufacturing engineering trends.

- Perform capacity planning to enhance throughput and scalability in testing/service center operations.

Leadership and Operational Management:

- Supervise and guide the TE and PE teams, providing mentorship and technical assistance as required.

- Plan and allocate resources, monitor progress, and ensure timely deliverables for testing and process improvement projects.

- Enforce compliance with quality and safety standards (e.g., ISO, AI internal standards) and industry regulations.

- Proactively communicate updates, challenges, and solutions to stakeholders, including management and cross-functional teams.

Education and/or Experience

-Bachelor's degree in Electrical Engineering, Industrial Engineering, Computer Science, or related field (Master's degree preferred).

-Minimum 8-10 years of relevant experience, with at least 3 years in a supervisory or leadership role in test engineering, process engineering, or a related field.

-Proven experience in AI servers, high-performance computing systems, or similar hardware/software environments is highly preferred.

Certifications (Preferred):

- EMC Proven Professional or equivalent hardware/server certifications.

-Six Sigma Green/Black Belt (for process optimization).

-Certifications related to AI/ML hardware or data management (e.g., Deep Learning Institute).

Essential Skills:

- Extensive experience with advanced diagnostic tools for understanding and troubleshooting server components, including CPUs, GPUs, memory modules, power supplies, and storage devices.

- Strong background in AI servers (e.g., PowerEdge, DGX systems) and familiarity with related software such as TensorFlow, PyTorch, or AI/ML frameworks.

-Knowledge of process optimization tools (e.g., Six Sigma, Lean Manufacturing, or Kaizen methodologies).

-Experience working with test automation frameworks and tools (e.g., Python, MATLAB, LabVIEW, or other scripting/programming languages).

-Familiarity with server management tools (e.g., iDRAC, IPMI) and operating systems such as Linux and Windows Server.

-Support high-performance computing environments with cutting-edge AI server technologies.

-Drive continuous improvement in processes to maintain commitment to high quality and operational excellence.

-Act as the point of escalation for complex technical and operational issues at the service center.

All qualified applicants will receive consideration for employment without regard to race, color, national origin, age, ancestry, religion, sex, sexual orientation, gender identity, gender expression, marital status, disability, medical condition, genetic information, pregnancy, or military or veteran status. We consider all qualified applicants, including those with criminal histories, in a manner consistent with state and local laws, including the California Fair Chance Act, City of Los Angeles' Fair Chance Initiative for Hiring Ordinance, and Los Angeles County Fair Chance Ordinance.

Job Reference: JN -032026-416876